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Abstract — Cooperative spectrum sensing is a robust strategy 
tliat enhances tlie detection probability of primary licensed users. 
However, a large number of detectors reporting to a fusion center 
for a final decision causes significant delay and also presumes 
the availability of unreasonable communication resources at 
the disposal of a network searching for spectral opportunities. 
In this work, we employ the idea of sequential detection to 
obtain a quick, yet reliable, decision regarding primary activity. 
Local detectors take measurements, and only a few of them 
transmit the log likelihood ratios (LLR) to a fusion center in 
descending order of LLR magnitude. The fusion center runs 
a sequential test with a maximum imposed on the number of 
sensors that can report their LLR measurements. We calculate 
the detection thresholds using two methods. The first achieves 
the same probability of error as the optimal block detector. In 
the second, an objective function is constructed and decision 
thresholds are obtained via backward induction to optimize 
this function. The objective function is related directly to the 
primary and secondary throughputs with inbuilt privilege for 
primary operation. Simulation results demonstrate the enhanced 
performance of the approaches proposed in this paper. We also 
investigate the case of fading channels between the local sensors 
and the fusion center, and the situation in which the sensing cost 
is negligibleQ 

Index Terms — Cognitive Radios, Cooperative Spectrum Sens- 
ing, Detection delay. Sequential detection. 

I. Introduction 

Cognitive radio is an emerging technology aimed at solving 
the problem of spectrum under-utilization caused by static 
spectrum allocation [1|. The technology allows a cognitive, 
also called unlicensed or secondary, terminal to make efficient 
use of any available spectrum at any given time. This involves 
the detection of the activity of licensed, or primary, users 
so that the secondary operation does not interfere with the 
primary networks and disrupt their services. This detection 
task may be extremely difficult due to a large array of factors 
including, inter alia, the wide variety of primary networks and 
the uncertainty about the propagation conditions between an 
active primary terminal and a secondary sensor attempting to 
detect primary activity. If the channel between a primary trans- 
mitter and a secondary sensor is in deep fade, the sensor would 
falsely decide that the probed channel is vacant. Secondary 
transmission on this channel would then cause interference on 
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the primary link. Relying on one sensor to detect the presence 
of primary operation is therefore highly unreliable [P. 

Enhancement of sensing reliability requires a system of 
spatially distributed multiple sensors cooperating together in 
order to mitigate the impact of channel uncertainty provided 
that the channels between them and the primary transmitter 
are independent. Many papers have shown improvement in 
spectrum sensing capabilities through cooperation between 
individual cognitive users (see, e.g., |3|, [4J, [5|, [6|, L2I). 
However, in order to reap the benefits of distributed detection 
and cooperative diversity, efficient schemes are needed to 
combine the data obtained by the local sensors |8|. There 
is typically a fusion center at which the final decision is 
made. Whether the fusion center receives the raw unprocessed 
data from local sensors or summary messages obtained after 
some local processing, there are challenges concerning the 
transfer of data to the fusion center It may be assumed, for 
instance, that there are orthogonal channels available for the 
transmission from the local detectors to the fusion center 
In the context of cognitive radio, this assumption might not 
be practical given that the main objective of the cognitive 
users is to find communication resources. Moreover, if the 
duration of data transmission between the local sensors and 
the fusion center is large, then the primary user may switch 
back to activity, thereby depriving the secondary network of a 
communication opportunity. In the cognitive radio setting, not 
only many terminals are needed for robust detection of primary 
activity, but also this detection should be done in the quickest 
way possible and under very strict constraints regarding the 
communication between the cognitive detectors and the fusion 
center. 

Sequential detection is known to reduce the time of de- 
tection for a certain specified detection reliability (see, e.g., 
||9l , ifTOl . ifTTl ). In contrast with block detection that operates 
on a fixed predetermined number of samples, the number 
of samples in sequential detection is a random variable. In 
addition, for the case of binary hypothesis testing, sequen- 
tial detection requires two thresholds and not only one for 
operation. This allows for the decision to continue acquiring 
more samples to satisfy some detection performance measures. 
There has been considerable interest in applying sequential 
analysis to distributed or decentralized detection problems 
(see, e.g.. El, lEl, Ql, fla. Ell, Inl). According to 
[118 1. distributed implementation of sequential detectors can 



have several forms. For instance, the sensors may forward 
their local decisions to the fusion center, which then runs 
a sequential test to obtain a global decision. On the other 
hand, the sensors themselves may perform sequential tests and 
cooperate together with or without a fusion center In [T9l, 
the problem of distributed sequential detection in the presence 
of communication constraints is studied. An algorithm is 
developed for optimal rate or bandwidth distribution among 
detectors under a fixed bandwidth constraint. The work is 
based on the sequential probabiUty ratio test (SPRT) of Wald 

13. 

In the context of cognitive radio, distributed sequential 
detection has been investigated, for example, in ll20l . 11211 . 
ED, II23I . In the scheme proposed in II2TI . each sensor 
computes the log-likelihood ratio (LLR) for its observations 
and reports it to the fusion center over a perfect reporting 
channel. The LLR's are accumulated sequentially at the fusion 
center till their sum is found sufficient to cross either of two 
predefined thresholds. The thresholds are obtained on the basis 
of the desired global detection and false alarm probabilities. 
The authors also investigate the case of sequential detection 
with model uncertainties. In ll22l . the detectors are divided into 
different sets according to the local signal-to-noise (SNR) of 
the sensing channel. The set with highest SNR sends to the 
fusion center first, followed by lower SNR until the fusion 
center can make a final decision. A doubly sequential test 
is proposed in [231 in which both the local detectors and 
the fusion center operate sequential tests. The detectors have 
thresholds computed using Wald's approximations |9 |. When 
the first sensor reaches a decision, it conveys its decision to 
the fusion center which is running a second sequential test for 
a global decision on channel availability. The authors make 
several suggestions pertaining to the termination criterion at 
the fusion center For example, the fusion center stops when 
it has received a specified number of decisions in favor of a 
particular hypothesis. 

In addition to distributed sequential detection, we make use 
of the idea of ordered transmissions provided in 1*241 for the 
purpose of energy efficient signal detection in wireless sensor 
networks (see also ||251 ). By ordered transmissions, it is meant 
that the local sensor with the most reliable current measure- 
ment sends first its LLR value to the fusion center In order 
for a sensor to know that it has the most reliable LLR value, 
there is no need for extensive information exchange. A timer 
backoff mechanism is suggested in |24| to resolve this issue. 
Note that the sensor selection criterion is its instantaneous 
measurement quality and not the average SNR as in 1221 for 
example. We discuss the scheme in detail in this paper. Suffice 
it now to mention that the proposed scheme in ll24l achieves 
the minimum average probability of error attained when all 
the sensor LLR's are used for a decision, and at the same 
time lowers the average number of transmissions needed to 
reach a decision. The scheme in (24^ is utilized in 1261 for 
sequential fusion in wireless sensor networks. It is certainly 
beneficial for a cognitive radio setting because it enhances the 
secondary throughput by quickly seizing the available spectral 



opportunities. 

In this paper, in order to reach a global decision at the 
secondary fusion center as quickly and reliably as possible, 
we adopt the ordered transmission scheme of ll24l . However, 
we impose a strict constraint on the maximum number of 
transmissions to the fusion center We therefore extend the 
scheme to work for the case when the fusion center is to 
make a decision after receiving a maximum of K observations 
from a total of M local sensors. Assuming a slotted primary 
network where the primary user may switch activity every 
fixed amount of time, only few observations can be used 
within a time slot to make a decision regarding the availability 
of a transmission opportunity. In other words, the decision 
regarding primary activity must be made soon enough to allow 
for channel access, and under the reasonable assumption of a 
narrowband cognitive control channel, only a few sensors can 
actually participate in the detection process. 

Having imposed this constraint of a maximum number of 
reports from the local sensors, we devise a scheme where 
the decision thresholds employed at the secondary fusion cen- 
ter are computed using a dynamic programming framework. 
Given the LLR observations sent to the fusion center, the 
posterior probability of the channel being free is calculated 
and compared to the thresholds for a decision. We derive 
expressions for the UkeUhood functions of ordered LLR's and 
use these in both proposed schemes. For the dynamic pro- 
gramming scheme, a weighted sum of primary and secondary 
throughput objective function is constructed that also accounts 
for transmission costs, lost transmission opportunities and a 
penalty for collision with primary transmission. It is important 
to note that, in |27| . dynamic programming is also employed 
to get the decision thresholds. However, it is assumed that 
all cognitive sensors would be able to report their energy 
measurements to the fusion center, where each detector senses 
one of the multi-band channels. 

In summary we make the following contributions in this 
paper. We use a cooperative sequential detection framework 
to address the tradeoff between throughput and detection 
reliability in cognitive radio systems taking into account the 
strict time limitation during which the decision must be 
made, which also imposes a limit on the number of sensors 
involved in the decision process. We employ an ordered LLR 
transmission scheme modifying the method of threshold deter- 
mination in (231 to account for the constraint on the number 
of reporting sensors. We also use dynamic programming to 
obtain the decision thresholds. The proposed method improves 
the throughput performance, which is the desired goal in the 
context of cognitive radios. We prove that if sensing has no 
cost, the optimal sequential decision is either to declare that the 
primary is idle or to continue sampling without ever deciding 
that the primary is active except possibly at the last decision 
stage. Finally, we address the impact of fading on the reporting 
channels linking the cognitive detectors to the fusion center. 

The rest of the paper is organized as follows: Section HI] 
presents the system model of cooperative spectrum sensing in 
a cognitive radio network. Section [III] discusses the previous 



work on ordered transmissions scheme, then discusses our 
proposed extension derivation. Section |IV] and |V] describe 
the proposed dynamic programming scheme. The effect of 
fading on the channels between the local detectors and the 
fusion center is explained in Section |VI] In Section IVIII we 
provide simulation results and elaborate on the difference in 
performance when both schemes are used. We conclude the 
paper in Section IVIIII 

II. System Model 

We consider a slotted primary system as shown in Figure 
[H where the primary activity, whether on or off, does not 
change during the time slot duration, Tg. Primary activity 
switches independently from one slot to the next. There are 
M cognitive sensors which take a number of measurements 
at the beginning of each time slot and compute a function of 
these measurements. A maximum of K sensors among the M 
sensors forward the results sequentially to a fusion center at 
which the final decision regarding primary activity is taken. 
We consider binary hypothesis testing at the fusion center with 
the following two hypotheses: 

Hq: Sensed channel is free ; Hi: Sensed channel is busy 

The prior probabilities of each, denoted by ttq and (1 — ttq), 
respectively, are assumed to be known. Observations from 
different sensors are conditionally independent given either 
hypothesis but can be non-identical. Let Xi (n) be the received 
signal at the i*'' sensor at instant n, where i — 1,2, M. At 
each sensor i, Xi (n) are independent given each hypothesis 
and are identically distributed. Assume that a total of N 
samples are taken over a time duration tn. Under the two 
hypotheses, Xi (n) is given by 



N 



fXi{n){Xn I Hi) 



fXi(n){Xn \Hq) 



(3) 



This is the quantity that is computed by the local detector and 
reported to the fusion center sequentially as explained below. 

Defining the local signal-to-noise ratio (SNR) as 7^ = 

a 

we can compute the LLR at the i sensor as 
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is a chi- 



=1 

- log (1 + 7,)- 

Note that under H^ the summation ^ 
square distribution with N degrees of freedom. This is the 
same for the summation '^^=1 l^li+'-l ) under Hi. That is, 
the likelihood functions of Yi given Hq or Hi are shifted and 
scaled chi-square distributions with N degrees of freedom. 

The LLR values are quantized and coded using a fixed 
number of bits. As in the case of the local signal power, the 
quantization noise is assumed to be negligible. The informa- 
tion exchange between the cognitive sensors and the fusion 
center occurs on the common control channel. We initially 
assume that the transmission of the LLR values to the fusion 
center is perfect, then we consider the effect of fading in 
the channels between the local sensors and the fusion center 
in Section [Vll We assume that the procedure of seizing the 
control channel and sending the observation of one sensor 
requires an amount of time which we denote as r. Figure 
[U illustrates the time durations tn, t and r^. 



HQ:X,{n)=W,{n), n^l,2,...,N 
Hi:X,{n)=S^{n) + W,{n), n^l,2,...,N ^' 

where Wi is additive white Gaussian noise (AWGN) having 
the same noise power, cr^, at all sensors. Without loss of 
generality, the received primary signal Si (n) is assumed to 
be real zero-mean Gaussian random variable. The conditional 
probability distributions of Xi (n) given Hq and Hi are de- 
scribed by fx,{n){xn I Ho) and fx,{n){xn \ Hi), respectively, 
such that 

/x,(„)(x„ I Ho) - NiO,a^) 
fx,{n){xn I Hi) ^ N{Q,<jl+a^) 

where cr^ is defined as the average received primary signal 
power at the i*'' local sensor, and is assumed to be fixed over a 
time slot and to change relatively slowly over time. The values 
of (T^ are assumed to be known at the local sensors and are 
transmitted periodically after being quantized and coded to the 
fusion center on a low-rate common control channel that is 
not on the band being sensed. The quantization noise effect is 
assumed to be negligible. The LLR at the i*'' sensor is defined 
as 
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Fig. 1. The primary time slot lias a duration of Ts units of time. Over tjv, 
each one of M cognitive sensors takes A' samples at the beginning of the 
time slot. Afterwards, there are K mini-slots, each of duration r, over which 
one LLR observation is transinitted from one cognitive sensor to the fusion 
center over a low-rate common control channel. 

Since the transmission of each computed LLR value takes 
some time r, eliciting another measurement from the sensors, 
though improving the reliability of detection, wastes a duration 
of T from the potential secondary transmission opportunity in 
case the primary is off. This causes a decrease in secondary 
throughput. In other words, we have a reliability-throughput 
tradeoff [28 1 which we can control by allowing only the subset 
of the cognitive detectors with the most reliable observations to 
transmit their LLR's to the fusion center This is implemented 



by having two thresholds at the fusion center. The LLR with 
maximum magnitude is transmitted first to the fusion center. 
A decision can be made, but if the metric is between the two 
thresholds, the second highest LLR in magnitude is transmitted 
to the fusion center and is combined with the first, and then a 
decision is attempted again. This continues until a decision in 
favor of Hq or Hi is made, or K LLR's are accumulated at 
the fusion center. If k sensors, 1 < A; < if , are probed before 
a decision is reached, the time to make a decision is tn + kr. 
Since the slot duration is r^, this leaves Tg — ~ kr for 
transmission. Given the durations rjy, t and r^, it is obvious 
that K must satisfy the inequality Tg — T]\[ — Kt > 0. 

Some final notes are in order regarding the operation of the 
secondary network. 

(a) The LLR values can be quantized to a few bits only with 
no considerable performance degradation when compared 
with the case where unquantized LLR's are used ||29l . 

(b) We assume a single hop network in which any broadcast 
by a secondary node is heard by the fusion center and all 
other nodes. This assumption is necessary such that when 
a node seizes the common channel to transmit its LLR, 
all other nodes become aware of its transmission. 

(c) We assume that the sensors have nearly synchronized 
clocks. This issue and its practical aspects are discussed 
extensively in ll30l . 

III. Ordered Transmissions Scheme 

In this section we describe the scheme in |24|. A backoff 
timer is set at each of the M sensors according to the 
magnitude of the locally computed LLR. Specifically, the 
timer is decreased as the absolute value of the sensor's LLR 
increases. Consequently, transmissions proceed such that the 
most confident observations and informative measurements are 
sent first to the fusion center. The fusion center then compares 
the accumulated sum of the received LLR's to two thresholds 
and makes one of three decisions accordingly; declare Hq, 
continue taking the next ranked observations, or declare Hi. 
Let yl™] denote the LLR of rank m = 1,2,...,K received 
during the m*'* reporting mini-slot, where m = 1 is the highest 
rank. For 1 < fc < A', the decision strategy is expressed as 



If 



k 

m—l 



> t 



(H) 



Declare Hq 
Continue 
Declare Hi 



(5) 



Note that the thresholds are generally time-dependent, i.e, their 
values depend on the stage index k. In order to force a stopping 



at k 



K, 



(H) 
K 



When the statistics evidently favor one hypothesis over the 
other, the decision is made and LLR transmissions from the 
local sensors stop. Otherwise, more observations are taken 
possibly till the end of the primary time slot. In ll24l . the 
data-dependent thresholds t^^^^ and t^^-* are given by 



log ( 



-)-{M-k) 

1 - TTo 

) + {M-k) 



(6) 



■ 1 - TTo ' 

where M—kis the number of sensors that have not transmitted 
their LLR's yet at the fc*'' stage, and \Y^''^\ is the absolute 
value of the LLR received at stage k. However, if this scheme 
is forced to take a decision in favor of either Hq or Hi at stage 
K, then it should work only for the case K — M, as can be 



M 



The transmission is 



seen by the fact that only t\j ~ t 

ordered in terms of the absolute LLR value. That is, < 
Fl'^ljjVm > k. Hence, is greater than any LLR value 

transmitted later to the fusion center It can be shown that 
this choice of thresholds ensure that this scheme has the same 
average probability of error as maximum a posteriori (MAP) 
procedure where all the LLR's are summed and compared to 
log (ttq/ (1 — TTo)) ||241 . The average probability of error when 
the guessed hypothesis, H, is not the true one is given by 



Pe = {l-^Q)Pr[H = HQ\Hi 



+ ^0 Pr [H = Hi\ Ho) (7) 

The advantage of this system is that the average number of 
transmissions needed to reach a decision is about half the total 
number of the sensors communicating with the fusion center 

Here, and in our ICASSP paper fJTl, we consider extending 
the scheme in [,24J to work for K < M case such that we only 
process the K LLR's with highest magnitudes out of the M 
LLR's. Given the sequence of observations y^^l = yi,Y^'^^ ~ 
1/2, --Y^^^ — UK, the optimal MAP block detector when the 
highest in magnitude K out of M LLR values are used has 
the following decision rule: 
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< log 

Ho 



1 - TTo 



(8) 



YW..YW \h 



(yi, J/2, ■■yK\H) is the joint density func- 
tion of the magnitude-ordered LLR's given hypothesis H. 

In this section, we consider only the case of identical 
sensors. Let the probability density function of LLR value 
y under hypothesis H be given by /y {y\H). Assuming the 
statistical independence of LLR's at various sensors, the joint 
density functions in ^ can be given by 1321 

■I'yw..yw\h {yi^y2,-yK\H) = 

/y {yi\H) ../y {yK\H) (Pr{|r| < \yK\}) 

(See Appendix lAl for details). 

In order to perform sequential detection, we compute the 
accumulated sum at the fusion center at stage k, which is 

7r(2/m I Hi 



M-K 



(9) 



given by 



k 

E 



Ids 



/y(2/m I Ho) 



However, this accumulated 



sum can be replaced by X]m=i f™' because as shown in y < is used. Therefore, 
Appendix |A] such processing will yield the same value of the 
LLR received at each stage k. This accumulated sum is then 



compared to two thresholds, i^^^ and tt^' , such that when 
it is greater than tl, , Hi is declared, and if it is smaller 
than then iJp is declared. Otherwise, if it does not cross 
either thresholds, the fusion center continues to accumulate the 
LLR's from the next ranked sensors. For all the stages with 
the exception of the K^^ stage, i.e., for \ < k < K — 1, these 
thresholds are given by 

r(L) , TTo 



r(H) 



1 - TTo 

{K~k) \yk\ ~ (M-K) max p{y) 
0<v<\yk\ 



tl =log 



(10) 



1 - TTo 

+ {K-k) \yk\- (Af - K) min p {y) 

0<v<\yk\ 



where for LLR Y 



p (y) = log 



= Ids 
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r_jY{w\Hi)dw 

JlyfY{w\Ho)dw 



(11) 



is a correction term to (|6) that appears at each stage k to 
account for the probability that all the other {M — K) sensors 
would have LLR values less than \yk\- Since it is not a 
monotonic function, we need to span over the range from 
< y < \yk\ and choose the maximum and the minimum 
values of p{y). At the end of the slot, i.e., k — K, 



f(H) 



f(L) 



log 



1 - TTo 



- (M - K)p(yK) (12) 



We now prove that using the thresholds in (fTOt ensures that 
the average probability of error achieved by the sequential 
scheme using the above thresholds is the same as the minimum 
average probability of error attained by a MAP detector oper- 
ating on K out of M measurements with the highest absolute 
magnitude. Consider the situation in which the accumulated 



LLR exceeds t 
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J2 ^ 



r(H) 



(13) 



Note that 
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m — l 
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Since \yk\ > IvkI, minimizing over the domain < y < \yk\ 
achieves at least the same minimum when the domain < 



min p (y) = min log 

0<y<\yk\ 0<y<\yk\ 



< min log 
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Hence, when X]m=i exceeds t^^', it also exceeds 

(M - K) log ■ 
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This implies that 
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Yym + {M-K)\og 
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> log 
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which is the MAP rule for deciding Hi based on a block of 
best K among M sensors. The case of deciding Ho can be 
proved similarly. The benefit reaped from using the sequential 
scheme is that the average number of observations needed 
to make a decision is close to -y, which has been showed 
via simulations. In the next section, we adopt a method for 
determining the thresholds that allows a tradeoff between 
the average probability of error and the speed of reaching a 
decision concerning primary activity. 

IV. Proposed Ranked Sequential Scheme 

In this section, we use dynamic programming to calculate 
the thresholds for sequential detection. Specifically, we employ 
the backward induction technique which provides the optimal 
action to be taken in order to minimize the overall decision 
cost ll33l . In the analysis below we assume that the fusion 
center knows the statistics of all detectors, which are updated 
periodically over the common control channel and change 
relatively slowly over time as mentioned in Section |II] 

Since we adopt the ordered transmission scheme and assume 
perfect reporting, if an LLR value is received, all subsequent 
LLR's would have values with a lesser magnitude. In other 
words, if the value of the LLR received at stage fc — 1 is equal 
to a, then the probability Pr(|y[™l| > |a|) = 0, Vm > k. 
Recall that Ym is the LLR from the m*'* sensor, 1 < m < M, 
whereas yt™l is the LLR with the m*'* highest magnitude 
that is transmitted to the fusion center at the m*'* stage with 
m = 1,2, ..K. 



Let TTfc be the probability of the channel being idle at stage 
k given the sequence of observations yl^l = — 
2/2, ..F''"'' = yk- ProbabiUty nk can be obtained recursively 
as follows: 

TTfc = Pr(i?o|yi,2/2, --yk) = 

/yW|yHi..yi'-ii,go {yk\yi-yk-i, Ho)Pt {Ho\yi..yk-i) 
E /y,M|y[ii..F[fc-ii,H, (yfc|2/i--2/fe-i,^r)Pr(-ffr|yi--yfe-i) 

r=0,l 

E fYW\Yl''-i],H^ {yk\yk-i,Hr) (r + (1 - 2r) 7rfc_i) 

r=0,l 

(14) 

where in the last step, we use the Markovian property in- 
duced by ordered transmissions that Y^'^^ is independent of 
__Ylk-2] gjygjj y[fc-i] jjjjjj either hypothesis (Theo- 
rem 2.4.3, [l32l). 

What is needed for backward induction is the conditional 
probability of fI™! given under both Hq and Hi. 

Assume that the probability density function of LLR value Yk 
under hypothesis H (either Hq or Hi) is given by fy^ {y\H), 
which is a scaled and shifted chi-square distribution with 
N degrees of freedom as mentioned in Section |II] Define 
l3k,bM - Pr{[|>fc| > \b\] I H}. The value of l3k,b,H can be 
readily computed knowing the likelihood functions of Yk- 
For non-identical sensors, the joint distribution of and 
conditioned on hypothesis H when — I7I < a < \j\ 
and m > 2 is given by: 

M M 



k=lj^k 



(15) 



where S'™^- ^ is a set of m 



2 sensors chosen from the 
M sensors with sensors k and j excluded. The number of 
sets S^'J^ is (*(l22). Parameter is i equal to unity when 



V e S^j and zero otherwise. ^[m-i] (a, 7 | H) 







if \a\ > Note that ( flSl l is derived as follows. The joint 
distribution between and involves a summation 

over all possible sensor pairs. Since, given a particular hypoth- 
esis, the LLR's from the local sensors are independent, we 
get the product fy,, {a \ H) fy. (7 | H). For m > 2 and due 
to the ordered nature of LLR transmission, we have to — 2 
sensors with absolute LLR values exceeding = 'y. 

The rest of the sensors should have absolute LLR values that 
are less than y[™l = a. In the case of identical sensors, 
fy, iy\H)^ fy (y I H) and ^k.b.H = Pb^H, 

/^[H,y[™-il (a, 7 I H)^M{M-l)- 

fy (a I H) fy (7 I H) (/57,«)™"' (1 - Pc.h)''-" 

(16) 

In order to obtain the conditional distribution 
(a|7, H), we need the distribution fyim-i] (7|-ff) 



because 

fy[^]\Y\^-i]{a\l,H) ^ — — (17) 

Distribution /y[m-ij {'^\H) can be readily obtained as 

/yl™-ll (7|i/) = 
M 

5^-2 ^5^fe 



(18) 



where S*^ is a set of to — 2 sensors chosen from the M 

k 

m-2 



sensors with sensor k excluded. The number of sets S] 
Parameter Xy is equal to unity when v G S. 



IS 



\m-2) 



and zero otherwise. In the case of identical sensors, jSk^^.H — 



y[m-l 



Mfy {l\H) (1 - P,.Hf-^' ^''^ 

We construct now the cost-to-go function which is used to 
obtain the optimal policy in optimal stopping problems |34|. 
The optimal policy here is comprised of the optimal thresholds 
to be used at each stage to determine whether to stop and 
make a decision regarding the channel occupancy, or continue 
obtaining more LLR measurements. Assume that the cost of 
deciding / at stage k when j is the true hypothesis is A^^ , and 
the cost to continue taking observations is c > 0. Define J^^' 
as the minimum cost-to-go at stage k of the finite horizon K. 



Jr^(7rfe,r['=l) = minjAgo tt^ + A^i (1 
Aio ^fc + ^11 (1 ~ ^fe), 



-E 



y[fc + l]|y[fc] 



[k] 



(20) 



where tt^ and tt^+i are related through the expression in (Il4l 
with k replaced by fc + 1. The first term in (l20t is the cost of 
deciding Hq, whereas the second term is the cost of deciding 
Hi. These correspond to the channel decided to be free or 
busy, respectively. The third term in (l2Qt is interpreted as the 
expected cost when the fusion center decides that it should 
continue taking more observations from local sensors. The 
conditional expectation is given by 



E 



y[fc + l]|Y[fc] 



4^j(7r.+i,r[^+ii)|y[^-: 



yW| (21) 



and 



+ /y[*+ii|YW,H, (l-TTfc) 



(22) 



The parameter c represents the tradeoff between the time 
taken till a decision is made and the average probability of 
error As c increases, the fusion center becomes more likely 
to favor one of the two hypotheses in a shorter time using only 
a few LLR's from the cognitive detectors. At the last stage, 
i.e., when k — K, the fusion center has two choices only; to 
declare Ho or Hi. This allows backward induction from the 
last stage 



jjf ^(^K, rl^l) = min |a^o + Xoi (1 - ^k) , 



^10 T^K + Af;^ (1 - -kk 



K 



(23) 



Note that there is no actual dependence of J^'' on yl^l 
because there is no more sampling after the K*^^ stage. 
j''j^}^{-kk-i, y[^~^l) can be obtained using ( |20] i for all values 
of yl^^^l. The process can then be repeated to obtain the 
thresholds and optimal decisions. It is clear that this is a 
computationally extensive task due to the dependence on 
last observation. In particular, the thresholds at each stage 
would be a function of the value of the LLR sent in the 
previous stage. To reduce the complexity, we propose here 
to use the distributions of Y^^^ without conditioning. This 
makes the problem considerably more manageable. We have 
verified through simulations that at least for a few number of 
sensors dropping this dependency has a negligible effect on 
the objective function. Dropping the explicit dependence on 
yl'^l, the dynamic program now acquires the form 



J, 



'f\'Kk) = min |aJ;o vTfe + (1 - TTfc) , 

4+1 K+l) 



Aio TTfc + \\i (1 - TTfe) , 



(24) 



such that 
with 



4^i(^fe+i) = / Jk+i^^k+i)fYi>'+^] (y) dy (25) 



C + Ey[fc + lJ 



(y) = 7rfe/y[fc+ij(y I Hq) + (1 - 7rfc)/y[fc+ij(y | Hi) 
, , , (26) 

7rfc/y[fc+ii {y I Hq) + (1 - 7rfc)/y[fc+ij (y | Hi) 
Finally, {y \ H) can be computed using an expression 



(28) 



similar to 



M 



Note that the optimal thresholds and decisions can be 
calculated offline so long as the relevant system parameters 
are fixed. When the fusion center gets the LLR observations, it 
uses the pre-computed thresholds to decide whether to decide 
in favor of Hq or Hi, or to request the transmission of one 



more LLR value from sensors. The calculation should be 
redone when any of the parameters, such as the local SNR's 
at the cognitive detectors, change significantly. 

V. Performance Optimization 

We now provide expressions for costs A^^ introduced in 
Section |IV] Recall that X^j is the cost to decide i when 
j is true at the fc*'* mini-slot of duration r. Our objective 
is to optimize the system performance with a focus on the 
achievable primary and secondary throughput. Specifically, 
when we deal with throughput, we mean a weighted sum 
of primary and secondary throughputs where a factor uj, 
< w < 1, is multiplied by primary throughput and 1 — ui 
is used to weight the secondary throughput. The closer ui 
to unity, the more emphasis we put on the primary rate. 
This implies more protection for the primary link against 
interference and service interruption. Parameter u should be 
chosen by the primary network to satisfy the transmission and 
rate requirements of primary users. 

Define r/p and r/s to be the probability of correct reception of 
the primary and secondary signals in the presence of receiver 
noise only, respectively. Similarly, define 6p and Sg to be the 
probability of correct reception of the primary and secondary 
signals in the presence of noise in addition to interference from 
the other user due to concurrent transmission. Given that Rp 
and Rs are the primary and secondary rates of transmission, 
respectively, the costs of different decisions are then given by 

(ts - Tn - kr) 



An 



est 



(1 - Uj) RsT]s 

{ts - Tat - kr) 



(29) 



where e^t is the transmission cost or expended energy for the 
secondary terminal if it transmits for the whole slot duration. 
This cost is given in rate units to be compatible with the first 
term in 



■^01 



ujRp Sp- {I -uj) Rsds 



{ts - Tn - kr) 



ept 



est ("--"^-^^") +P 



(30) 



where ept is the transmission cost or expended energy for the 
primary terminal expressed in units of rate, and P is a penalty 
term for collision with primary user 



Ajo - Lf 



(31) 



where L[ is the cost of losing the opportunity to access the 
spectrum when the channel is idle. This term can be positive 
to account for other negative consequences for remaining 
idle than the throughput being zero. The failure to transmit 
may increase the transmission delay for a delay-sensitive 
application, or cause packet loss due to queue overflow, etc. 



A^i = -ujRpTjp + e. 



pt 



(32) 



where Lb is the cost of losing the opportunity to access the 
spectrum when the channel is busy. All the costs e^t, ept, L{, 
ib and P are nonnegative. 



The first term of expression (129) shows the gain of the 
secondary user when an idle channel is correctly detected. It 
depends on the weight assigned to the secondary user, the time 
remaining from the time slot after making the decision, and 
the probability of correct detection in the absence of primary 
interference. Similarly, the first term of (|32| is the primary 
throughput in the case of correct detection by the secondary 
users. The first two terms in (130) represent the possibility of 
correct reception in spite of the occurrence of a collision due 
to mis-detection. The interference survival probabilities Sp and 
5s would be typically small in value. 

Recall that parameter c > is the cost of taking one more 
observation. Given the aforementioned performance metrics, 
c can be understood as the cost, in units of rate, of the 
transmission phase in which the secondary fusion center 
solicits another LLR measurement and receives it from the 
appropriate local sensor 

A. Throughput Maximization 

If we adopt the weighted sum throughput maximization 
objective, then setting all the costs, including c, to zero except 
for the throughput terms, declaring Hi at any stage in the slot 
except for the last stage might cause losing the opportunity to 
access the spectrum by the cognitive radio device. Instead, 
the fusion center should solicit more observations, for a 
possible declaration of a free channel before the end of the 
slot, which consequently increases the normalized secondary 
throughput. Hence, rather than two thresholds as in classical 
sequential detection, we can just have one threshold and two 
decisions: to declare Hq and assign one of the cognitive 
terminals to transmit over the probed channel, or to ask for 
one more observation if less than K LLR observations have 
been collected. Consequently, the objective cost-to-go at stage 
k can be expressed as 



ji'^H^k) = min \ TTfc + (1 - TTfc) , 



(33) 



where Aqq and A^^ are given by 



and (l30t with est = 
Cpt = 0. The above expression is valid for k < K. Al the last 
stage where k ^ K, we have 



J^k\'^k) = min <j A^o ttk + (1 " '^k) , 



T^K + Afi (1 — ttk] 



(34) 



where Xf-Q and A{^ are given by (l3Tl i and ( |32] | with k — K, 
Gst = Gpt = and = L{ — 0. We actually can show 
that starting with the two-threshold case, if we set c = 0, 
— and Lh = Lr = 0, the two-threshold 



est 



--pt 



case would converge to the one-threshold case as the lower 
threshold would always be set to except at the very last 
stage. The proof is given in Appendix iB] Note that if there are 
multiple channels the two-threshold case would not necessarily 



converge to the the one-threshold case because declaring Hi 
would be equivalent to switching to another channel in order 
to search for a transmission opportunity. 

VI. Fading Channels Between Sensors and Fusion 
Center 

In this section, and instead of the previously assumed perfect 
reporting channels, we consider fading channels between the 
local detectors and the fusion center. The statistical distribution 
of the channel gain between the i*'' sensor and the fusion 
center is given by fg^ (x), possibly different for different 
sensors. We assume that the instantaneous values of the 
channel gains are known by the sensors and the fusion center 
These values are fixed over a duration Tc which is multiples of 
the primary slot duration r^. Note that if the channel between 
a sensor and a fusion center is in deep fade, then even if the 
sensor has a reliable observation, it may not be able to transfer 
it correctly to the fusion center within the mini-slot duration 
r. For the transmission from the i*'* sensor to the fusion center 
to be decodable, the rate of transmission should be lower than 
the link capacity which, for a fixed and known channel gain 
gi at the transmitter and additive white Gaussian noise at the 
receiver, is given by 



(35) 



where W is the reporting channel bandwidth, af is the noise 
variance at the receiver of the fusion center, and Pi is the 
transmitted power by the i*'' sensor We consider the case 
where a local sensor transmits with a rate given by 



= W\og 1 



(36) 



where > 1 is the signal-to-noise ratio gap to capacity 
ll35l . This factor accounts for the practical limitations that 
forces transmission at a rate below link capacity for correct 
reception. Assume that a number b of information bits needs 
to be transferred to the fusion center This would include 
the quantized LLR values with high resolution to justify the 
neglect of quantization noise. The amount of time needed for 
this transfer conditioned that the fusion center would be able 
to decode the transmission correctly is given by b/r;. We 
consider that this transmission should take no longer than Tb 
which is obviously smaller than t. In other words, the z*'' 
sensor would be able to send its LLR value to the fusion 
center in time Tb or less if 

b 

< Tb (37) 



This can be written as 



Via? f^si— 



1 



(38) 



Let gi be the right-hand-side of inequality ( |38] |. The probability 
then that a sensor would report its LLR measurement to the 



fusion center is given by 



fg^ (x) dx 



(39) 



Over a period of Tc only a fraction of the local detectors 
would participate in the reporting process to the fusion center 
Since the channels are assumed to be known at the sensors, 
each sensor can decide whether it should participate in the next 
sensing epoch. The fusion center also, knowing the channels, 
can know the subset of sensors that would be involved in 
the next sensing events that span a time duration of T^. Our 
analysis in the previous sections applies but with a number 
of sensors equal to those whose channels satisfy dSSl l. The 
average number of sensors participating in the sensing is 
simply 1 ^i- The probability that the number of sensors 
participating in sensing is equal to M, where < Af < M, 
is given by 



M 



(40) 



where 5jg is a set of M sensors from M sensors. The number 
of elements in set Sj^ is (|{). Factor a;, = 1 if sensor i belongs 
to the set Sj^j and zero otherwise. For symmetric channels 
between the sensors and the fusion center, 6i ~ 6 for all i and 
the probability of M sensors participating is 



S^' (1 - S) 



M-AI 



(41) 



The average number of sensors participating in the sensing 
and reporting process is, hence, equal to MS. This means that 
the performance of the network with M sensors would, on 
average, be equal to the performance of a network with non- 
fading reporting channels, albeit with M6 sensors. 

VII. Simulation Results 

In this section, we investigate via numerical simulations 
the performance of our proposed schemes. Unless otherwise 
stated, the simulation parameters are as follows. The noise 
variance at each detector — 1 and the transmission of each 
observation takes t = 0.1 units of time. The time taken at 
the beginning of a time slot with duration = 1 to collect 
N observations is tn = 2t. This leaves K — 8 stages in 
the slot in which the fusion center takes the ordered LLR 
observations. We consider the case of equal prior probabilities 
of both hypotheses, i.e., ttq = 0.5, and equal local signal to 
noise ratio over the channels between the local sensors and 
the primary user, cr^ =2, i = 1,2, M . Each sensor takes 

= 3 samples in the duration rjv. The cost, c, to continue 
without deciding on one of the two hypotheses is set to 0.0001. 

In order to minimize the average probability of error in 
equation (|24] |. we use the decision costs: X'j^j = 1 when i j 
and Af, — when i = j. Cost Aj; — means that no cost 
is incurred if the decision at stage k is the true hypothesis. 
Moreover, to study the objective of maximizing the weighted 
sum throughput, we use the expressions (|29]l-(l32]i for the costs. 
In the simulations, the transmission costs for both the primary 



and secondary users, Cpt and e^t, are set to 0, as well as 
the penalty factor P. Furthermore, the costs of losing the 
opportunity to access the spectrum when it is idle or busy, 
Lf and L;,, are assumed to be 0. The transmission success 
probabilities are rjp ^ 1, rjs ~ 1, 5p = Q and 5s = 0. 

A. Two-Threshold Based Method 

Figure |2] shows the normalized primary and secondary 
throughputs for a weight w — 0.5 and a weight lo ~ 0.999. 
The latter case is the case of interest in the context of cognitive 
radios given the privileges of the licensed primary users. The 
normalized throughput is obtained via simulating the system 
using the optimal thresholds. If the simulation is run over Q 
time slots, the normalized secondary throughput is given by 



Q 



=1 



kqT 



(42) 



(S) 

where Iq is equal to unity if the one of the secondary 
users transmits successfully over the q*'* time slot and zero 
otherwise, and kg is the number of probed sensors in the 
g*'* time slot. The normalized primary throughput is equal 
to ^ Yl'^^i Iq^^ Rp, where Iq^^ is equal to unity when the 
primary user transmits successfully during the q*^ time slot 
and zero otherwise. It is clear from the figure that increasing 
the weight puts more emphasis on the primary throughput. The 
difference is, however, small in the case of a large number of 
sensors. For the rest of the results in this section and just for 
demonstration purposes, we use cj = 0.5. 
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2, K = 8, -Qp = 1 = ri: 
and c = 0.0001 for lj = 0.5 and u) = 0.999. In the legend, "Iry" refers 
to primary and "2ry" refers to secondary. The scheme used to generate this 
figure is dynamic programming for throughout optimization. 
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In Figure [3] it is clear that the modified scheme of f24], 
which we hereafter refer to as Modified B&S scheme, achieves 
a lower probability of error than the dynamic programming 
scheme, denoted as DP scheme, since it achieves the same 




probability of error as the MAP block detector However, as 
illustrated in Figure |4] the number of sensors involved in the 
sensing process in the dynamic programming scheme is lower 
than that in the Modified B&S scheme. The number of sensors 
actually converges slowly to unity using dynamic program- 
ming. Increasing the number of sensors in the network causes 
the ranked LLR observations to be more informative about the 
status of the channel. Therefore, when the DP scheme is used, 
the decision can be made after taking observations from one 
sensor on average, given that it has the maximum absolute 
LLR. However, for the Modified B&S scheme, the sensing 
time increases as the number of CR's in the network increases. 
This follows from the fact that the greater the number of sen- 
sors in the network, the greater the distance between the two 
thresholds of comparison in (fTol i causing more observations to 
be taken. Thus, the dynamic programming scheme represents a 
tradeoff of the sensing time with the error probability, via the 
c parameter The performance enhancement increases when 
the objective becomes to maximize the achievable weighted 
sum throughput as is evident from Figure H] Figure |5] demon- 
strates these results on the normalized secondary throughput, 
assuming unity primary and secondary transmission rates Rp 
and Rs- For the normalized secondary throughput it is obvious 
from the simulation results that it converges to approximately 
TTo — ^^^T^^ in the DP scheme. This is the throughput for 
the hypothetical case of using just one "perfect" sensor that 
reveals that true state of the channel. 

Using Modified B&S scheme, it is noted that when the local 
sensors' SNR is high, the number of probed sensors is almost 
upperbounded by shown in Figure |6] For low SNR 

regime, the number of probed sensors is close to though 
may exceed it when the SNR is low depending on the value of 
K. If we use conditional distributions for the measurements at 
local sensors for which the correction term p (y) in equation 
([Tol l is zero, it can be shown analytically following a proof 



in ll24l that the maximum number of probed sensors is -y. 
This is the case for example when the local measurements 
have conditional distributions that, in contrast with those in 
are Gaussian with different means and the same variance. 
The simulation results for such "shift in mean" case are also 
provided in Figure |6] 

Taking into consideration the fading effect in the channels 
between the sensors and the fusion center as explained in 
Section IVII only a subset of the total number of local de- 
tectors becomes involved in the sensing and reporting process 
according to the fading coefficients. This means that the fusion 
center takes longer time to detect the presence or absence of 
the primary user. Figure |7] shows the effect of fading, which is 
to slightly increase the average number of probed sensors. The 
parameters used to produce this figure are 6 = 20, = 50 
kHz, Th = ^ — 0.5 msec, = 2, and ^ = 5. Assuming that 
the channel gains between the sensors and the fusion center are 
i.i.d and are exponentially distributed with unity mean, we can 
easily find that S is equal to 0.743. Therefore, slightly less that 
three quarters of the number of sensors participate, on average, 
in the process of spectrum sensing. The observation that fading 
has a minor effect is just because of the simulation parameters. 
Note that in Figure |4] the average number of sensors is already 
less than 2 even for a relatively small M. 

B. One-Threshold Based Method 

As mentioned in Section |V] instead of applying the two- 
threshold based scheme to the accumulated LLR observations 
at the fusion center, it can be sufficient to perform the 
comparison using one threshold only, when the sensing cost, c, 
is set to 0. In fact, this requires consuming more mini-slots in 
sensing the channel, till a decision is made about the channel 
being idle. Otherwise, sensing continues till the end of the 
slot, declaring a busy channel. 

Figure [8] shows the convergence of the two-threshold 
scheme to the one-threshold scheme at zero sensing cost, for 
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Fig. 5. Normalized Weighted Sum Throughputs for both DP schemes and the 
Modified B&S scheme, vs. M, N = 3, cr?. = 2, = 8, and c = 0.0001. 



Fig. 7. Average sensing time vs. M, with N = 3, a-^ . = 2, K = 8, using 
the DP scheme for en'or optimization under fading between the local sensors 
and the fusion center 
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Fig. 6. Number of sensors involved in sensing, for high SNR and low SNR 
versus different values for K, with AI = 100, N = 3 and tr^. = 2 for low 
SNR and a^. =50 for high SNR. The "shift in mean" case refers to using 
conditional distributions for local sensor measurements that, in contrast with 
^2)' 3re Gaussian with the same variance and different means. 



the M = 10 case. As c decreases, the lower threshold is 
reduced at all stages. When c = 0, the lower threshold is 
at all stages except for the last one at which a decision 
must be made. The fusion center therefore continues to take 
observations till the end of the slot if there is no sufficient 
statistics to declare Hq. At any stage up to fc = 8, the fusion 
center has only two choices; to declare Ho or to probe more 
sensors, which is the main idea of the one-threshold scheme. 

To investigate the effect of the cost parameter c on the 
sensing time, we consider the dynamic programming scheme 
with decision costs set for minimizing the average probability 
of error. Figure |9] shows that at c = the fusion center does 
not make a decision till the end of the slot. Increasing the cost 



Fig. 8. Thresholds of comparison for different costs vs. stage index k, with 
K = S, M = 10, = 2 and TV = 3. 

gives the fusion center the chance to make a decision before 
the end of the slot allowing for secondary transmission. 

VIII. Conclusion 

In this paper, we considered ordering the transmissions from 
cognitive users to the decision fusion center according to the 
information they carry about the status of the primary user 
Restricting the transmissions to the highly informative set 
of observations, i.e., the LLR's with the highest magnitude, 
increases the time remaining in the time slot after making the 
decision, thereby boosting the throughput while protecting the 
primary signal. We devised a sequential scheme that achieves 
the optimum error probability of the MAP block detector, but 
makes the decision faster. We also devised a scheme based 
on dynamic programming that allows a trades-off between 
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the average probability of error and the average sensing 
time. Simulation results have demonstrated that using dynamic 
programming approach to compute the thresholds required to 
make the decision at the fusion center yields a considerable 
enhancement in the performance in terms of reduced delay 
and increased secondary throughput. 

Future work needs to address the situation of multiple 
primary channels. Also of interest is the design of the multiple 
access scheme that allows the sensor with the highest LLR in 
magnitude to seize the common control channel and transmit. 
Another front that needs further investigation is the channel 
access scheme and how the discovered spectrum opportunities 
are to be distributed among the secondary users. 



Appendix A 



For the optimal probability of error MAP detector, using 
the best K out of M LLR magnitudes, the joint distribution 
of K observations under hypothesis H (either Hq or Hi) is 
given by fYi..YK\H {yi-,V2,-yK \ H). Since the sequence of 
observations forms a Markov chain (Theorem 2.4.3, [32] ) 



fY,^Y2...YK\Hiyi.y2, -yx I h) = fym (^w = ■ 
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For symmetric sensors, the conditional distribution of the 
sensor of rank j receiving an LLR value fI^I = a given that 
the sensor of rank j — 1 has already received an LLR value 



= 7 is given by 
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with l3x,H defined as Pr{|F| > \x 
variable Y is an LLR observation. Hence 
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Then, the likelihood ratio of the best K out of M LLR's for 
a block detector can be written as 

fY^..YK\HAyi,y2,-yK \ Hi) 
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fY{yi\Ho)..fY{yK\Ho){l-f3y,,Ho) 

We now show that the sequential algorithm may operate on 
the received LLR's directly rather than computing their LLR's. 
This requires that the LLR is a strictly monotonic function of 
the observation. Consider observation y with Ik given by 
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For a monotonic g function, conditioned on H (either Hi or 

Ho), 
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But the right-hand-side of (|49] l is given by 

fY {y I Hi) 



(48) 



(49) 



log- 



fY (y I Ho) 



Hi] 



fY ig-' (Ur) I Ho) 



(50) 



llr 



Therefore, 



lot 



llr 



(51) 



/ll/? (llr I -ffo) 

If the LLR monotonicity assumption is not satisfied, the 
thresholds in (fTOl l can be modified as follows: 
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The proof that these thresholds achieve the same average 
probability of error if the best K out of M sensors are used 
follows the same outline as that presented in Section (llllb . We 
focus here on the lower threshold. A decision in favor of Ho 
is made if the metric X)m=i log /yCy^igo) ^^'^ below If 

Z^m=1^0g f^(y^\H„) ^ ' 



, fy {ym\Hi, 
2^ log ^ ,^ ^„ , < log 



m— 1 



log 



fy iVrnlHo 



1 - TTo 



1 - TTo 



, /y (ym|gi) 
'/y (ym|i?o) 



Then 



Vlog-p-7 < log 



7^0 



< log 

< log 
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- TTO 
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— max p (u) < 

0<!/<|afc| 



(M — K) max p (y) 

0<V<\Vk\ 



(M-K) max p{y) 

0<y<\yk\ 

(M-K) umx p{y) 
0<y<\vK\ 



{M - K) log 



1-A 



yK,Hi 



1 - PyK.Ho 

(54) 



which is the MAP rule for a decision based on a block of the 
K most reliable measurements among M observations. 

Appendix B 

In this appendix we prove that when the throughput terms 
are preserved while all other costs, including parameter c, are 
set to zero, the lower detection threshold for the sequential 
procedure becomes always zero except for the last stage. This 
means that, from a throughput point of view, the optimal 
decisions based on the observations are either to declare Ho 
and then select a cognitive terminal for channel access, or to 
continue taking more samples so long as less than K LLR 
values have been gathered. 

We start by proving the concavity of function jj^ (vTfe) using 
induction. At the last stage 



mm < — r. 



1 - 



Tat + Kt 



-Tp (1 - TYk) 



(55) 



where r 



[\ — Lo) Rs and — ujRp. For k < K, 



Jk i'^k) = min <^ - 1 - 



TTfc, 



-rp (l-TTfe), c + *fc(7rfe) \ (56) 



where {nk) is given by 



*fc (TTfc) 



J, 



K 

k+1 



TTfc/o 



T^fc/o + (1 - 7rfc)/i 



TTfe/o + (1 - TTfc) fl 



^y[fe+l] 



(57) 



Note that we use /o in place of /y[fc+i] (y | Hq) and /i in 
place of /y[fc+ii(y | Hi) to simplify notation. Parameter c is 
kept in expression (|56] | and is set to zero after the concavity 
of (TTfc) is established. 

If ^'fc (TTfc) is concave, then c + ^'fc (TTfc) is concave and, 
consequently, (ttj,) is concave because (ttj.) is the 



minimum of three concave terms II361 . We now prove the 
concavity of 5'^ (tt^) under the assumption that J^_^i ('"'fc+i) 
is concave. Function ^fc (Tr^) is concave if it satisfies the 
following inequality 

At*fc (x) + (1 - k) (z) < ^-fe {kx + (1 - k) z) (58) 

Vk g [0, 1] |l36l. Let 

e = [a;/o + (1 - a:) fl] + [^/o + (1 - ^) /i] (59) 

The right-hand-side of (ISST i then can be written as 

tK f ^fo 



1 



x/o + {l-x) /ij Jf+i 
z/o + (l-z)/iljf+i 



2^/0 + (1 - X)fl 

zfo 

zfo + {1- z)h 



(60) 



Noting that f [xfo + {I - x) h] + ^ [zfo + {I - z) h] = 1 
and using the assumption that J^j^^ is concave, we obtain 



K*fe (a;) + (l-K)*fc (z) < 



Jk+i ( -^/o + 



1 - K 



Zfo) edrl'^+H 



(61) 



Factor e can be expressed as e = e/o + (1 — e)/i, where 
e = + (1 — k) z. Using this, inequality ( |6T| l becomes 

K-^k {x) + (1 - k) *fe {z) < 



e7o + (1 - e) /i 

(e) = */c + (1 - k) z) 



e7o + (1 - e) fl 



(62) 



Therefore, 5*^: (tt^) is concave in tt^ assuming that 
•-^k+i (""fc+i) Is concave. It is evident from ( |55] | that ('''if ) 
is concave since it is the minimum of two affine terms. By 
induction both '^k {T^k) and {iTk) are concave in Tr^ for 
k<K. 

We now study the value of (irk) at tt^ = 0. It is clear that 
Jk (0) = -Tp. Since (0) = / (0) /idFl^l = -rp. 



i (0) = min{0, —rp,c — Vp} = —Vp for c > 0. It 

: — r„ for 



J'^ (0) 



then 

is straightforward to show that ^ffc (0) 
l<k <K. 

When TTfc = 0, the minimum of the three terms of (|56] | is 
—Tp obtained from the second term — rp (1 — Tik) for c > 0. 
The value of the derivative of jj^ {-Hk) at vTfc — is equal to 
the derivative of the term —Vp (1 — Tik) at — 0, which is 
equal to Vp. The derivative of ^I^fe (tt^) is given by 



diTk 



dJk+i dir. 



fe+i 



+ Jk+iiT^k+i) fo-f: 



TTfe/o + (1 - TTfc) /l 
^y[fe+ll 



^y[fe+l] 



(63) 



smce TTfc+i 

is straightforward to show the 
Hence, 



when tt^ = and it 



at TTj, 



is equal to 



7rfc=0 



[/o - /i] drl'^+H - rp 



(64) 



Since '^k is concave, it satisfies the following inequality [|36] 



*fe (^) < *fe (a;) + 



dx 



(65) 



For 

-r. 



a; 
(1 



: and z = TTfc, vpfe (tt^) < -r^ + TpTTk = 
TTfc). This means that when c ~ 0, the third term in 
is always less than or equal the second term — Tp (1 — Tr^). 
Hence, the cost of getting one more sample is always less than 
or equal to the cost of declaring Hi. In other words, when 
c — 0, the lower decision threshold is zero at all the stages 
with the exception of the last one because it is less costly to 
continue sampling rather than declaring Hi. 
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